Add `V1_1` to `NTVTR` #830

addie9800 · 2025-11-10T19:25:10Z

No description provided.

MaxDall · 2025-11-10T23:27:59Z

src/fundus/publishers/tr/ntvtr.py

+    class V1_1(V1):
+        VALID_UNTIL = datetime.date.today()
+
+        _paragraph_selector = XPath("//div[contains(@class, 'content')]/p[text()]")
+        _summary_selector = XPath("//h2")
+        _subheadline_selector = XPath("//div[contains(@class, 'content')]/p[not(text()) and strong]")
+
+        _topics_selector = XPath("(//ul[contains(@class, 'text-[#3D619B]')])[1]/li")
+
+        @attribute
+        def body(self) -> Optional[ArticleBody]:
+            return extract_article_body_with_selector(
+                self.precomputed.doc,
+                paragraph_selector=self._paragraph_selector,
+                summary_selector=self._summary_selector,
+                subheadline_selector=self._subheadline_selector,
+            )
+
+        @attribute
+        def topics(self) -> List[str]:
+            return generic_topic_parsing(
+                strip_nodes_to_text(self._topics_selector(self.precomputed.doc), join_on=","),
+                substitution_pattern=re.compile(r"-\s*$"),
+                delimiter=",",
+            )
+
+        @attribute
+        def images(self) -> List[Image]:
+            return image_extraction(
+                doc=self.precomputed.doc,
+                paragraph_selector=self._paragraph_selector,
+                upper_boundary_selector=CSSSelector("h1"),
+                lower_boundary_selector=XPath("(//img[@alt='Google Play'])[1]"),
+                image_selector=XPath("//div[@property='articleBody']//img[not(@fetchpriority='auto')]"),
+                author_selector=XPath("./ancestor::div[contains(@class,'relative') and (picture or img)]/div"),
+            )


That looks like it might be worth opening a new major version

Add V1_1 to NTVTR

729853d

addie9800 requested a review from MaxDall November 10, 2025 19:25

remove removesuffix()

067dbb2

MaxDall reviewed Nov 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `V1_1` to `NTVTR` #830

Add `V1_1` to `NTVTR` #830

addie9800 commented Nov 10, 2025

Uh oh!

MaxDall Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add V1_1 to NTVTR #830

Are you sure you want to change the base?

Add V1_1 to NTVTR #830

Conversation

addie9800 commented Nov 10, 2025

Uh oh!

MaxDall Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add `V1_1` to `NTVTR` #830

Add `V1_1` to `NTVTR` #830